NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Assessing the potential of GPT-4 to perpetuate racial and gender biases in health care: a model evaluation study

https://doi.org/10.1016/S2589-7500(23)00225-X

Zack, Travis; Lehman, Eric; Suzgun, Mirac; Rodriguez, Jorge A; Celi, Leo Anthony; Gichoya, Judy; Jurafsky, Dan; Szolovits, Peter; Bates, David W; Abdulnour, Raja-Elie E; et al (January 2024, The Lancet Digital Health)

Full Text Available
Embedding electronic health records onto a knowledge network recognizes prodromal features of multiple sclerosis and predicts diagnosis

https://doi.org/10.1093/jamia/ocab270

Nelson, Charlotte A; Bove, Riley; Butte, Atul J; Baranzini, Sergio E (December 2021, Journal of the American Medical Informatics Association)

Abstract ObjectiveEarly identification of chronic diseases is a pillar of precision medicine as it can lead to improved outcomes, reduction of disease burden, and lower healthcare costs. Predictions of a patient’s health trajectory have been improved through the application of machine learning approaches to electronic health records (EHRs). However, these methods have traditionally relied on “black box” algorithms that can process large amounts of data but are unable to incorporate domain knowledge, thus limiting their predictive and explanatory power. Here, we present a method for incorporating domain knowledge into clinical classifications by embedding individual patient data into a biomedical knowledge graph. Materials and MethodsA modified version of the Page rank algorithm was implemented to embed millions of deidentified EHRs into a biomedical knowledge graph (SPOKE). This resulted in high-dimensional, knowledge-guided patient health signatures (ie, SPOKEsigs) that were subsequently used as features in a random forest environment to classify patients at risk of developing a chronic disease. ResultsOur model predicted disease status of 5752 subjects 3 years before being diagnosed with multiple sclerosis (MS) (AUC = 0.83). SPOKEsigs outperformed predictions using EHRs alone, and the biological drivers of the classifiers provided insight into the underpinnings of prodromal MS. ConclusionUsing data from EHR as input, SPOKEsigs describe patients at both the clinical and biological levels. We provide a clinical use case for detecting MS up to 5 years prior to their documented diagnosis in the clinic and illustrate the biological features that distinguish the prodromal MS state.
more » « less
Full Text Available
Predictability and stability testing to assess clinical decision instrument performance for children after blunt torso trauma

https://doi.org/10.1371/journal.pdig.0000076

Kornblith, Aaron E.; Singh, Chandan; Devlin, Gabriel; Addo, Newton; Streck, Christian J.; Holmes, James F.; Kuppermann, Nathan; Grupp-Phelan, Jacqueline; Fineman, Jeffrey; Butte, Atul J.; et al (August 2022, PLOS Digital Health)
Li-Jessen, Nicole Yee-Key (Ed.)
Objective The Pediatric Emergency Care Applied Research Network (PECARN) has developed a clinical-decision instrument (CDI) to identify children at very low risk of intra-abdominal injury. However, the CDI has not been externally validated. We sought to vet the PECARN CDI with the Predictability Computability Stability (PCS) data science framework, potentially increasing its chance of a successful external validation. Materials & methods We performed a secondary analysis of two prospectively collected datasets: PECARN (12,044 children from 20 emergency departments) and an independent external validation dataset from the Pediatric Surgical Research Collaborative (PedSRC; 2,188 children from 14 emergency departments). We used PCS to reanalyze the original PECARN CDI along with new interpretable PCS CDIs developed using the PECARN dataset. External validation was then measured on the PedSRC dataset. Results Three predictor variables (abdominal wall trauma, Glasgow Coma Scale Score <14, and abdominal tenderness) were found to be stable. A CDI using only these three variables would achieve lower sensitivity than the original PECARN CDI with seven variables on internal PECARN validation but achieve the same performance on external PedSRC validation (sensitivity 96.8% and specificity 44%). Using only these variables, we developed a PCS CDI which had a lower sensitivity than the original PECARN CDI on internal PECARN validation but performed the same on external PedSRC validation (sensitivity 96.8% and specificity 44%). Conclusion The PCS data science framework vetted the PECARN CDI and its constituent predictor variables prior to external validation. We found that the 3 stable predictor variables represented all of the PECARN CDI’s predictive performance on independent external validation. The PCS framework offers a less resource-intensive method than prospective validation to vet CDIs before external validation. We also found that the PECARN CDI will generalize well to new populations and should be prospectively externally validated. The PCS framework offers a potential strategy to increase the chance of a successful (costly) prospective validation.
more » « less
Full Text Available
Knowledge Network Embedding of Transcriptomic Data from Spaceflown Mice Uncovers Signs and Symptoms Associated with Terrestrial Diseases

https://doi.org/10.3390/life11010042

Nelson, Charlotte A.; Acuna, Ana Uriarte; Paul, Amber M.; Scott, Ryan T.; Butte, Atul J.; Cekanaviciute, Egle; Baranzini, Sergio E.; Costes, Sylvain V. (January 2021, Life)

There has long been an interest in understanding how the hazards from spaceflight may trigger or exacerbate human diseases. With the goal of advancing our knowledge on physiological changes during space travel, NASA GeneLab provides an open-source repository of multi-omics data from real and simulated spaceflight studies. Alone, this data enables identification of biological changes during spaceflight, but cannot infer how that may impact an astronaut at the phenotypic level. To bridge this gap, Scalable Precision Medicine Oriented Knowledge Engine (SPOKE), a heterogeneous knowledge graph connecting biological and clinical data from over 30 databases, was used in combination with GeneLab transcriptomic data from six studies. This integration identified critical symptoms and physiological changes incurred during spaceflight.
more » « less
Full Text Available
Recent Advances in Systems and Network Medicine: Meeting Report from the First International Conference in Systems and Network Medicine

https://doi.org/10.1089/sysm.2020.0001

Kurnat-Thoma, Emma; Baranova, Ancha; Baird, Pat; Brodsky, Elia; Butte, Atul J.; Cheema, Amrita K.; Cheng, Feixiong; Dutta, Shuchismita; Grant, Christina; Giordano, James; et al (January 2020, Systems Medicine)
null (Ed.)
Full Text Available

Search for: All records